Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update tolerance of MemoryOverconsumptionMpiCheck #206

Merged
merged 3 commits into from
Jul 5, 2024

Conversation

lucamar
Copy link
Collaborator

@lucamar lucamar commented Jun 7, 2024

The test MemoryOverconsumptionMpiCheck often fails to meet the expected performance with PrgEnv-cray and PrgEnv-gnu on Pilatus with CPE 23.12 and COS 3.0.
I suggest to increase the tolerance to 10% below the reference value, which will make the test pass on Pilatus as well.

@lucamar lucamar added the bugfix label Jun 7, 2024
@lucamar lucamar requested review from jgphpc and ekouts June 7, 2024 10:10
@lucamar lucamar self-assigned this Jun 7, 2024
@lucamar
Copy link
Collaborator Author

lucamar commented Jun 10, 2024

The testing stage on Dom and Piz Daint failed with the following error:

ERROR: you must specify -C with one of the following: mc,gpu,ssd
sbatch: error: cli_filter plugin terminated with error

@lucamar
Copy link
Collaborator Author

lucamar commented Jul 3, 2024

The Testing phase of the CI is failing on Dom and Piz Daint with the error message:

ERROR: you must specify -C with one of the following: mc,gpu,ssd
sbatch: error: cli_filter plugin terminated with error

The failure of SlurmQueueStatusCheck on Eiger is due to some unavailable compute nodes.

@lucamar
Copy link
Collaborator Author

lucamar commented Jul 4, 2024

I have submitted SD-61887 on behalf of a user about the less memory available on compute nodes after the system upgrade.

Copy link
Collaborator

@jgphpc jgphpc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

more details in SD-61887

@jgphpc
Copy link
Collaborator

jgphpc commented Jul 5, 2024

@jenkins-cscs test this

@jgphpc jgphpc merged commit bc23ec0 into eth-cscs:main Jul 5, 2024
1 of 2 checks passed
@lucamar lucamar deleted the update-MemoryOverconsumptionMpiCheck branch July 5, 2024 08:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants